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Abstract 


The Small World phenomenon has inspired researchers across a number of fields. A breakthrough 
in its understanding was made by Kleinberg who introduced Rank Based Augmentation (RBA): add to 
each vertex independently an arc to a random destination selected from a carefully crafted probability 
distribution. Kleinberg proved that RBA makes many networks navigable, i.e., it allows greedy routing 
to successfully deliver messages between any two vertices in a polylogarithmic number of steps. We 
prove that navigability is an inherent property of many random networks, arising without coordination, 
or even independence assumptions. 


’Research supported by a European Research Council (ERC) Starting Grant (StG-210743) and an Alfred R Sloan Fellowship, 
t Supported in part by an Onassis Foundation Scholarship. 


1 



1 Introduction 


The Small World phenomenon refers to the fact that there exist short chains of acquaintances between most 
pairs of people in the world, popularly known as Six Degrees of Separation [17]. Milgram’s famous 1967 
experiment [16] showed that not only such chains exist, but they can also be found in a decentralized manner. 
Specifically, each participant in the experiment was handed a letter addressed to a certain person and was 
told of some general characteristics of the person, including their occupation and location. They were then 
asked to forward the letter to the individual they knew on a first-name basis who was most likely to know 
the recipient. Based on the premise that similar individuals have higher chance of knowing each other 
(homophily), the participants typically forwarded the message to their contact most similar to the target, a 
strategy that yielded remarkably short paths for most letters that reached their target (many did not). 

Kleinberg’s groundbreaking work, formulated mathematically the property of finding short-paths in a 
decentralized manner as navigability [8, 10]. Since then, much progress has been made [9] and the concept 
of navigability has found applications in the design of P2P networks [5, 19], data-structures [4, 15] and 
search algorithms [14, 18]. Key to decentralization is shared knowledge in the form of geometry, i.e., shared 
knowledge of a (distance) function on pairs of vertices (not necessarily satisfying the triangle inequality). 

Geometry. A geometry ( V. d) consists of a set of vertices V and a distance function 7 : V x V Hi_, 

where d(x, y) > 0, d(x, y) = 0 ijfx = y, and d(x, y) = d(y , x), i.e., d must be a semi-metric. 

Given a graph G'( V'. E) on a geometry ( V. d), a decentralized search algorithm is any algorithm that 
given a target vertex t and current vertex v selects the next edge {v,u} €E E to cross by only considering 
the distance of each neighbor u of v to the target t, i.e., dfu, t). The allowance of paths of polylogarithmic 
length in the definition of navigability, below, is motivated by the fact that in any graph with constant degree 
the diameter is D(log(n)), reflecting an allowance for polynomial loss due to the lack of global information. 

Navigability. A graph G ( V, E) on geometry (V. d) is <7- navigable if there exists a decentralized search 
algorithm which given any two s,t € V will find a path from s to t of length O (poly(log | V|)). 


In his original work on navigability [8, 10], Kleinberg showed that if G is the 2-dimensional grid then 
adding a single random edge independently to each v € V results in a navigable graph (with d being the 
LI distance on the grid). The distribution for selecting the other endpoint u of each added edge is crucial. 
Indeed, if it can only depend on d(v, u ) and distinct vertices are augmented independently, Kleinberg showed 
that there is a unique suitable distribution, the one in which the probability is proportional to d(v,u )~ 2 
(and, more generally, d(v,u)~ r for r-dimensional lattices). The underlying principle behind Kleinberg’s 
augmentation scheme has by now become known as Rank Based Augmentation (RBA) [11, 13]. 


Rank Based Augmentation. Given a geometry ( V'. d), a vertex v € V, and i> 0, let N v (t) be the number 
of vertices within distance l from u. In RBA, the probability of augmenting v with an edge to any u € V is 


P(v, u) oc 


1 

N u ( d(v,u )) 


( 1 ) 


The intuition behind RBA is that navigability is attained because the added edges provide connectivity 
across all distance scales. Concretely, observe that for any partition of the range of the distance function 
d into intervals, the probability that the (distance of the) other endpoint of an added edge lies in a given 
interval is independent of the interval. Therefore, by partitioning the range of d into O(logn) intervals we 
see that whatever the current vertex v is, there is always D((logn) _1 ) probability that its long-range edge 
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is to a vertex at a distance at the same “scale” as the target. Of course, that alone is not enough. In order to 
shrink the distance to the target by a constant factor, we also need the long-range edge to have reasonable 
probability to go “in the right direction”, something which is effortlessly true in regular lattices for any 
finite dimension. In subsequent work [11], aiming to provide rigorous results for graphs beyond lattices, 
Kleinberg showed that the geometric conditions needed for RBA to achieve navigability are satisfied by the 
geometries induced by set-systems satisfying certain conditions when the distance between two vertices is 
the size of the smallest set (homophily) containing both (see Definition 1 in Section 5). 

Another canonical setting for achieving navigability by RBA is when the distance function d is the 
shortest-path metric of a connected graph Gq(V, Eq) with large diameter ©(poly(n)). In that setting, if E,i 
is the random set of edges added through RBA, the question is whether the (random) graph G(V, Eq U Ef) is 
d-navigablc. Works of Slivkins [15] and Fraigniaud et al. [7] have shown the existence of a threshold, below 
which navigability is attainable and above which (in the worst case) it is not attainable, in terms of the dou¬ 
bling dimension of the shortest path metric of Gq. Roughly speaking, the doubling dimension corresponds 
to the logarithm of the possible directions that one might need to search, and the threshold occurs when it 
crosses 0(log log re). Thus, we see that even when d is a (shortest path) metric, very significant additional 
constraints on d need to be imposed. 

The remarkable success of RBA in conferring navigability rests crucially on its perfect adaptation to the 
underlying geometry. This adaptation, though, not only requires perfect independence and identical behavior 
of all vertices, but also a very specific, indeed unique, functional form for the probability distribution of edge 
formation. This exact fine tuning renders RBA unnatural greatly weakening its plausibility. Our goal in this 
paper is to demonstrate that navigability is in fact a robust property of networks that emerges from very 
basic considerations without adaptation, coordination, or even independence assumptions. 

2 Our Contribution 

As mentioned, at the foundation of navigability lies shared knowledge in the form of geometry. Our starting 
premise is that geometry imposes global constraints on the set of feasible networks. Most obviously, in a 
physical network where edges (wire, roads) correspond to a resource (copper, concrete) there is typically 
an upper bound on how much can be invested to create the network. More generally, cost may represent 
a number of different notions that distinguish between edges. We formalize this intuition by (i) allowing 
different edges to have arbitrary costs, i.e., without imposing any constraints on the cost structure, and (ii) 
taking as input an upper bound on the total cost of feasible graphs, i.e., a budget. We remain fully agnostic 
in all other respects, i.e., we study the uniform measure on all graphs satisfying the budget constraint. So, 
for example, if all edges have unit cost we recover the classic Erdos-Renyi Giji. m ) random graphs (except 
now m is a random variable, sharply concentrated just below the budget.) 

As one can imagine, the set of all graphs feasible within a given budget may contain wildly different 
elements. Our capacity to study the uniform measure on such graphs comes from a very recent general theo¬ 
rem we developed in [1], of which this work is the first application. At a high level, the main theorem of [1] 
asserts that if a subset S of the set of all undirected simple graphs Q„ on re, vertices is sufficiently symmetric, 
then the uniform measure on S can be well-approximated by a product measure on the edges, i.e., a measure 
where each edge is included independently with different edges potentially having different probabilities. 
Formally, a product measure on Q n is specified succinctly by a symmetric matrix Q € [0, l] nxn of prob¬ 
abilities where Qn = 0 for i € [re]. We denote by Gin. Q) the measure in which possible edge {i.j} is 
included independently with probability Qij = Qji. The main result of [1] then allows one to approximate 
the uniform measure by a product measure in the following very strong sense. 


3 


Sandwichability. The uniform measure U (S) over an arbitrary set of graphs S C Q n is (e, 5)-sandwichable 
if there exists a n X n symmetric matrix Q such that the two distributions Cr ~ G(n, (1 + e)Q), and the 
distribution G ~ U (S') can be coupled so that G~ C G C G + with probability at least 1 — 5. 

As discussed above, navigability requires some degree of structure in the underlying geometry. It is 
from this structure that we will extract the symmetry needed to apply the theorem of [ 1 ] and derive a product 
form approximation for graphs with a bounded total cost. Armed with such an approximation, establishing 
navigability becomes dramatically easier, allowing us to demonstrate its robustness and ubiquity. Roughly 
speaking, we isolate three ingredients that suffice for navigability on a geometry ( V. d): 

• A substrate of connections between nearby points on V, allowing the walk to never get stuck. 

• Some degree of coherence of the distance function d. 

• Sufficient, and sufficiently uniform, edge density across all distance scales. 

The first two ingredients are generalizations of existing work and, as we will see, fully compatible with 
RBA. The third ingredient is also motivated by the RBA viewpoint, but we will prove that it can be achieved 
in far more-light handed, and thus natural, manner than RBA. Moreover, in the course of doing so, we will 
give it a very natural economical interpretation as the cost of indexing. 

2.1 The two Basic Requirements and a Unifying Framework for RBA 

Substrate. A set of edges Eq forms a substrat e for a geometry ( V ., d), if for every (s. t) € V x V with s f t, 
there is at least one vertex v such that {s, r;} € Eq and d(v, t ) < d(s, t ) — 1. 

The existence of the substrate implies that (very slow) travel between any two vertices is possible, so 
that a decentralized algorithm never gets trivially stuck. 

Coherence is a notion that comes with an associated scale factor 7 > 1. Specifically, given a geometry 
(V. d ) we will refer to the vertices whose distance from a given vertex v £ V lie in the interval f/ k ~ 1 , 7 *'] as 
the vertices in the fc-th (distance) 7 -scale from v. Also, for a fixed A < 1 and any target vertex t f v, we will 
say that a vertex u is t-helpful to v if d(v, u) < 7 ^ (u is within the same 7 -scale as t), and d(u, t ) < A d(v, t ) 
(reduces the distance by a constant). 

Coherence. Let K = [log 7 | V|"|. A geometry (V. d) is 7 -coherent if there is A < 1 such that for all v € V: 

- For all k € \K], the number of vertices in the k-th distance scale from v is L\.(v) = Of/ 1 ''). 

- For all t f v, a constant fraction of the vertices whose distance scale from v is no greater than the distance 
scale oft are t-helpful to v. 

The two conditions above endow the, otherwise arbitrary, semi-metric d with sufficient regularity and 
consistency to guide the search. Although our definition of coherence is far more general, in order to convey 
intuition about the two conditions, think for a moment of V as a set of points in Euclidean space. The first 
condition guarantees that there are no “holes”, as the variance in the density of points is bounded in every 
distance scale. The second condition guarantees that around any vertex v the density of points does not 
change much depending on the direction (target vertex t) and distance scale. Besides those two conditions, 
we make no further assumptions on d and, in particular, we do not assume the triangle inequality. 

Coherent geometries allow us to provide a unified treatment of navigability since they encompass finite¬ 
dimensional lattices, hierarchical models, any vertex transitive graph with bounded doubling dimension and, 
as we prove in Section 5, Kleinberg’s set systems [11] (see Definition 1). 
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Theorem 1. Every set system satisfying the conditions of [11] is a y-coherent geometry for some 7 > 1. 

Theorem 2. Let ( V. d ) be any 7 -coherent geometry and let Eq be any substrate for it. If Ed is the (random) 
set of edges obtained by applying RBA to ( V., d ), then the graph G {V. Eq U Ed) is d-navigable w.h.p . 1 

Theorem 2 subsumes and unifies a number of previous positive results on RBA-induced navigability. 
Our main contribution, though, lies in showing that given a substrate and coherence, navigability can emerge 
without coordination from the interplay of cost and geometry. 

2.2 Navigability from Organic Growth 

As mentioned earlier, the success of RBA stems from the fact that the edge-creation mechanism is perfectly 
adapted to the underlying geometry so as to induce navigability. In contrast, we will not even specify an 
edge-creation mechanism, but rather focus only on the set of graphs feasible with a given budget. Our 
requirement is merely that the cost function is informed by the geometry. 

7 -consistency. Given a y-coherent geometry (V,d), a cost function c : V X V —> I is 7 -consistent if c 
takes the same value Ckfor every edge {u, n} such that d(u, v ) G (y k ^ l .y k ]. 

In particular, note that we make no requirements of the values {< 7 .}, not even a rudimentary one, such 
as being increasing in k. All that 7 -consistency entails is that the partition of edges according to cost 
is a coarsening of the partition of the edges by 7 -scale. In fact, even this requirement can be weakened 
significantly, as long as some correlation between the two partitions remains, but it is technically much 
simpler to assume 7 -consistency as it simplifies the exposition greatly. One can think of consistency as 
limited sensitivity with respect to distance. As an example, it means that making friends with the people 
next door might be more likely than making friends with other people on the same floor, and that making 
friends with people on the same floor is more likely than making friends with people in a different floor, but 
it does not really matter which floor. 

Cost-geometries. We say that T = T(V,d,c ) is a coherent cost-geometry if there exists 7 > 1 such that 
(V, d) is a y-coherent geometry and c is y-consistent cost function. 

Random Graphs of Bounded Cost. Given a coherent cost-geometry T(V. d, c) and a real number B > 0, 
let Gr(B) = {E C V x V : ^ XXeE c ( e ) — B}> '■ e > Gr (B) is the set of all graphs (edge sets) on V with 
total cost at most Bn. A uniformly random element ofGy(B) will be denoted as Ey = Ey(B). 

Applying the main theorem of [1], in Section 3 we will prove that random graphs of bounded cost (on a 
consistent cost-geometry) have a product measure approximation, in the following sense. 

Theorem 3. Given a coherent cost-geometry T, there exist a a unique function A (B) > 0 and constant 
BfyT) > 0 such that for every B > By(T') the uniform measure on Gy ( B) is (6, e)-sandwichable by the 
product measure in which the probability of every edge with cost c/,. is 


l + exp(A (B)c k ) 

and (5, e) = ^ \J \ 0 g\v] -> 2|V |The number A (B) > 0 can be explicitly defined in terms of {< 7 }. 

'Throughout the paper, all asymptotics will be with respect to the number of vertices |R| = n. Thus, with high probability will 
always mean “with probability that tends to 1 as n —> 00 .” 
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The regularizer A = MB) in Theorem 3 corresponds to the derivative of entropy with respect to the 
budget B (energy), i.e., is an inverse temperature, and depends on B in a smooth one-to-one manner. 

Theorem 3 will give us a great amount of access to the uniform measure on Gr(B). In particular, the 
upper approximation G(| V|, (1 + e)Q) will allow us to bound the total number of edges present in a typical 
graph, establishing sparsity for all sufficiently small budgets. On the other hand, the lower approximation 
G(|V|, (1 — e)Q) will allow us to establish a lower bound on the number of edges incident to each vertex 
of each distance scale. Combined with the spatial uniformity afforded by independence, this will allow us 
to prove that navigability emerges as soon as the total number of edges within each scale is large enough, 
establishing navigability for all sufficiently large budgets. 

Theorem 4. For every coherent cost-geometry T(V, d. c), where V\ = n, there exist numbers lM such that 
if Ey is a uniformly random element ofGy(B) then: 

- For all B < B + , w.h.p. |£r| = 0(n • poly(logn)). (Sparsity) 

- For all B > B , for any substrate Eq, w.h.p. the graph G(V. Eq U Ey ) is d-navigable. (Navigability) 

Note that Theorem 4 shows that navigability arises eventually, i.e., for all B > B~, without any further 
assumptions on the cost function or geometry. The caveat, if we think of B as increasing from 0, is that by 
the time there are enough edges across all distance scales, i.e., B > B~, the total number of edges may 
be much greater than linear. This is because for an arbitrary cost structure {c/,.}, by the time the “slowest 
growing” distance scale has the required number of edges, the other scales may be replete with edges, 
possibly many more than the required f2(n/poly log(n)). This is reflected in the ordering between B~ and 
B + that determines whether the sparsity and navigability regimes are overlapping. In particular, we would 
like B~ < B + and, ideally, the ratio R = B + / B~ > 0 to be large. Whether this is the case or not depends 
precisely on the degree of adaptation of the cost-structure {c/.} to the geometry as we discuss next. 

2.3 Navigability as a Reflection of the Cost of Indexing 

Recall that for every vertex v in a 7 -coherent geometry and for every distance scale k € [K\, the number 
of vertices whose distance from v is in the k-th 7 -distance scale is Pk(v) = 0 (M) . At the same time, (2) 
asserts that the probability of each edge is exponentially small in its cost. Thus, reconciling sparsity with 
navigability boils down to balancing these two factors. We will exhibit a class of cost functions that (i) have 
an intuitive interpretation as the cost of indexing, (ii) achieve a ratio R = B + JB~ > 0 that grows with n, 
i.e., a very wide range of budgets for which we have both navigability and sparsity, and (iii) recover RBA as 
a special case corresponding to a particular budget choice. 

Consider a vertex v that needs to forward a message to a neighbor u at the A -th distance scale. To do so, 
v needs to distinguish u among all other Pkiy) vertices at the A -tli distance scale, i.e., v needs to be able to 
index into that scale. To do so, it is natural to assert that v must incur a cost of 0(log 2 PjA'M) = 0 (A) (due 
to coherence) bits to store the unique ID of u among the other members of its equivalence class (in the eyes 
of v). Motivated by this we consider cost functions where for some a > 0, 



Theorem 5. For any coherent cost-geometry T(V, d, c*), where | V| = n, there exist B < B + such that: 
L B + /B~ = w(poly log n). 
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2. For all B E [B , B + ], w.h.p.: 

• \E r (B)\ = 0(n poly log n)). 

• The graph G (V.Eq U Ey(B)) is d-navigable. 

3. There exists B a E [B~, B + ] such that in the approximation of Er by G(|V|, Q ),for every {u. v } E 
E, Q*• = Q(N u (d(u,v))~ 1 ), i.e, Rank Based Augmentation is approximately recovered. 

Part 1 of Theorem 5 is equivalent to a scaling window of Q(^jrf^) for the exponent A, within which 
navigability holds with poly-logarithmic average degree. This corroborates Kleinberg’s work that gave a 
unique exponent of (3 = —1 in the context of RBA for the scaling (1) of probability. Nevertheless, under 
our framework this vanishing window for the highly sensitive paramater A produces a diverging range for 
values of B, explaining the purported fragility of RBA to looking at perturbations in the wrong scale. In 
fact, we can use this feature of our model to provide the first theoretical explanation for the discrepancy 
between theoretical results and empirical evidence [ 2 , 13, 6 ] showing that real networks exhibit an exponent 
(3 ~ 0.8 < 1. In our setting, exponents smaller than 1 correspond to higher average degree and thus we can 
attribute this discrepancy to finite size effects (finite n) and the densification [ 12 ] of networks. 

3 Deriving a Product Measure Approximation: Proof of Theorem 3 

We start with some definitions that will allow us to state the main theorem of [1], A set of graphs S C Q n is 
symmetric with respect to a partition V of the set of all possible ('/j edges, if the characteristic function of 
S depends only on the number of edges from each part of V but not on which edges. 

Edge Profile. Given a partition V = (Vi,... ,Vk) of the set of all possible (™) edges, for a set of edges 
E E Q n and for each k E [K], let m^(E) denote the number of edges in E from V/.. The edge profile of E 
is m (E) := (mi(E ),..., m K (E)). 

We denote the image of a symmetric set S under the edge-profile as m(5). As before let T). = V/, = 
\ Yluev Pk{u) be the total number of edges in part k of partition V. 

K / p k \ 

Edge Profile Entropy. Given an edge profile v = (v \,..., »k) the entropy ofw is Ent(v) = E log ( 

k=1 \ Vk ' 

The edge-profile entropy is used to express the number of graphs with a particular edge profile v as 
exp(ENT(v)). Given any symmetric set S C Q n , the probability of observing an edge profile v when 
sampling an element uniformly at random from S is then given by P s( v ) = p eENT(v) - Thus, in order to 
analyze the distribution of a random edge-protile, and consequently of a random element of G c (n, B ), we 
are going to exploit analytic properties of the entropy on the set of feasible edge profiles m(S). 

Convexity. Let Conv(A) denote the convex hull of a set A. Say that a V-symmetric set S C Q n is convex 
iff the convex hull ofm(S) contains no new integer points, i.e., ;/Conv(m(,S')) n N^ 1 = m(5). 

Entropic Optimizer. Given a symmetric set S, let m* = m* (S) E III/' be the unique solution to 



( 3 ) 
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Given the maximizer m*(5), the matrix Q* = Q*(£) is given by letting for all k G [A"] the probability 
of an edge e € Vu be Q* e := m* k /Pk■ To state the theorem, we need the following parameters that quantify 
the concentration of the uniform measure around its mode. 


Thickness and Condition Number. Given a partition V and a V-symmetric set S, we define 


Thickness: 
Condition number: 


p, = p(S) 
T = t(S) 


min min{mt ; Pk 
fee [K] 

5 K log n 


p(S) 


"41 


We now state the main theorem employed in the proof. 


(4) 

(5) 


Theorem 6 ([1]). Let V be any edge-partition and let S be any V-symmetric convex set. For every e > 
sJVl t(S), the uniform measure over S is (e, 5)-sandwichable for 5 = 2 exp 

In our setting, S is the set Gr(B) '■= {E C V x V : 4 'ff /e£E c e < B} of graphs with bounded 
average cost and V is the partition induced by the coherent cost function c. The set m(,S') is then given by 
m(S) = {v € N fc : 4 < B}. Hence, it is easy to see that Gr(B) is convex and symmetric, 

according to the previous definition, for all values of A. To prove Theorem 3, we need to find: 



(i) an analytic expression for the vector in* as a function of B 

(ii) the range of values of B for which applying Theorem 6 gives high probability bounds. 


3.1 Finding the Entropic Optimal Edge Profile 

We start by introducing a slight reparametrization in terms of the average-degree profile. For an edge set E, 
define the vector a( E) := m(E)/n, where as before m is the edge-profile. In the same spirit, let p k = P k /n 
denote the average number of edges in part (scale) k. Using this parametrization and by explicitly writing 
Conv(a(5)), we can equivalently express the optimization problem (3) as: 

K 

- X [( Pk ~ lo g(Pfc “ a k) + a k log(a fc )] 
k =1 
K 

^ " ttfcCfc ^ B 
k =1 

0<a fc < p k , Vfc € [A"] . 

We will refer to the above optimization problem as (A) and to its solution as a* = a *(B). Towards obtaining 
an analytic expression for a*, we first show that no coordinate k € [K] lies on the natural boundary {(). p k }. 

Lemma 1. The optimal profile a* € T>(B) := {a G (0,pi) x ... x (0 ,pk) '■ Yl,k c k a k < B}. 

Proof We prove the lemma by contradiction. We show that if a* is a solution of (A) such that a* f V , 
then there is ana* £ P for which objective function / takes a higher value. Specifically, for e > 0 
assume that there are indices 1 < i, j < K such that a* = 0 and a* > 5(e) 2 , where 5(e) = eci/cj. Define 


max H (a) = 

a 

subject to 


2 For any nontrivial values of B such an index can always be found. 







a*(e) = (a*,..., a* + e,..., a* — 5(e),..., a* K ). If h(e) = H( a*) — H (a*) is the difference in the objective 
function between the assumed optimal a* and the perturbation a*, then 

C' 

h'{e) = - log(e) + log (pi - a,i - e) + — (log (a,- - 5(e)) - log (pj - aj + 5(e))) . 

c j 

Observe, that liirp^o h' (e) = +oo, since we have assumed that a* > 0. This shows that every maximizer 
satisfies a* > 0. The same argument establishes that a* k < p k for all k G [K], Combining the two statements 
we get that any maximizer belongs in V. □ 


As a consequence, since they are inactive at the optimum, we can omit separable inequalities from the 
formulation. Further, define B := A Y^k=i Pk c k the average cost of the solution to the unconstrained version 
of (A), i.e., where a k := jj/ 2. If B > B then the absolute maximum entropic point a is still in T>(B) and 
thus the solution will be always a* k = a k for every such B. 

Lemma 2. There is a unique function A (B) that is one-to-one for all 0 < B < B and A (B) = 0 for all 
B > B, such that the unique solution of (A) is given by: 


4(B) 


_ Pk _ 

1 + exp [A(.B) • c k \ ’ 


Mk G [K] 


( 6 ) 


Proof. Uniqueness of the solution follows easily from convexity of the domain and concavity of the objec¬ 
tive function. Further, by Lemma 1, we can reduce the optimization problem (A) to the following: 


K 


max - y^[(pk-a k )log(p k -a k ) + aklog(a k )} 

a ^' 

k=1 

K 


subject to E E B . 

k =1 

To obtain an analytical solution, we form the Lagrangian of the reduced problem 


K ( K \ 

L( a, A) = - ^2 i(Pk ~ a k) log (p k - a k ) + a k log(a fe )] + A I B - ^ a k c k j . 

k =l V k =t J 

with the additional constraint that A > 0. The Karush-Kuhn-Tacker conditions read 


dL 


da k 

dL 

5A 


= 0 


log 

K 


a k 


K Pk &k 

= 0 ^ a k c k = B . 

Solving the first equation for a k ( A) we get 

* 

a k ~ 


— A c k 


k =1 


Pk 


1 + exp(Acfc) 

Substituting this expression in ( 8 ), we get the following function of A: 

K 

5(A) = ' Y 


Pk 


k=1 


+ exp(Acfc) 


(V) 

(8) 


(9) 
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and the second constraint can now be written as g(X) = B. The domain of g is the set of non-negative 
numbers on which g is continuous and infinitely differentiable. Under positive costs {ck}, it is easy to see 
that g'( A) < 0 for all B < B , hence, g is strictly decreasing in the interval [0, oo) and g( 0) = B. Thus, 
g : [0, oo) —>• [0, B\ is 1-to-l and thus invertible. This means that every budget in [0, B] is feasible and that 
for each such budget there is a unique A (B) := g~ 1 (B). For B > B, A (B) = 0. Therefore, we conclude 
that the maximizer is always unique for any feasible B and implicitly given by g( A) = B. □ 


3.2 Thickness n(B) of G r (B) and Sandwiching 


Our next step is to use the analytical solution to the optimization problem to instantiate the thickness param¬ 
eter fi defined in (4). Using ( 6 ), we can write: 


KB) 


min ml 
ke[K} 


= n ■ mm 


Pk 


k€[K] 1 + exp [A(S)cfe] 


( 10 ) 


where we have used the facts that that a* k = m* k /n and a k (B) < 1/2 =7> m k 
convenient expression, since 0 < Ck < oo we can write the cost as c/,. = 

when pk > 1. Thus, approximately 3 for large pk (eq. k) we have p(B) ~ 


< Pk — m* k . To get a more 
log (pk) where 0 < Pk < oo 


n ■ mm fcg[A'] 


■ l-A (B)//3 k 

Pk 


Theorem 6 , gives strong (non-constant) probability bounds as long as t{B ) <C 1. For our purposes we 
are going to consider that the maximum t(B) (respectively minimum B) that we allow is tq = log" 1 (n) 
(respectively Bo). Substituting the above expression for fi(B) in (5), we get that the condition t < tq can 
be rewritten as A (B) < Aq, where 


Ao = A o({pk},{Pk}) ■= min 

ke[K] 


( n log pk 


l0g \5iT log 2 (n) / log pk 




(ID 


Using the function g( A) defined in (9), we can express this constraint as B > Bo ■— g{ Ao). 

To conclude the proof of Theorem 3 we see that p(B) > 5 K log 2 (n) and t(B ) < , for all B > Bo. 

Applying Theorem 6 , for eo = \Jjgpg that is greater than \/12ro, we get that 5 <2 exp //(/l) 

The proof is concluded by substituting the bounds in the last expression. 


— T 


(B)) 


4 Navigability via Reducibility 

In this section we prove our results about navigability on coherent geometries. We start by giving a slightly 
more formal definition of coherence. Recall that given a geometry (V, d) and a fixed (scale factor) 7 > 1 > 
Pk(v) denotes the number of vertices in V at “distance” (qA'" 1 , g k ] from v. Further, for fixed A < 1 and 
all t / v € V , let k v t be the non-negative integer such that d(v,t ) € ('y kvt ~ 1 ,7 kvt ] and D\(v,t ) be the 
vertices in V whose distance from v is at most g kr ' and whose distance from t is at most A • d(v, t). Thus, 

| D\(v, t)| is the number of nodes that could facilitate greedy routing (f-helpful), i.e., reduce the distance to 
t by a constant factor A < 1. 

Coherence. Fix 7 > 1 and let K = |~log 7 (|U|)]. A geometry ( V, d ) is 7 -coherent if: 

(HI) Bounded Growth: 3A > 1, a > 0 such that Pk(v) G 7 fc [a, A], for all v G V and k G [K], 

(H2) Isotropy: 3<f> > 0,1 > A > 0 such that \D\(v,t)\ > <jrf kvt , for all s f t E V. 

3 When the approximation does not hold it means that n(B) = O(n) which trivially satisfies all the requirements we need for 
“sandwiching” and navigability. 
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For graphs on coherent geometries there are two requirements for navigability. The first basic require¬ 
ment is deterministic and amounts to the ability to move slowly (linear rate) towards the target. In the graph 
augmentation setting this was given by the fact that the initial set of edges formed a connected graph. On 
the other hand in Kleinberg’s work on set systems, the degree of vertices is set to 0(log 2 (n)), so that the 
probability of ever being stuck at a vertex is polynomially small. As mentioned in the introduction, we opt 
to adopt the more natural approach of assuming a substrate. 

Substrate. A set of edges Eq forms a substrat e for a geometry (V, d), if for every (s, t) G V x V with s f t, 
there is at least one vertex v such that {s,n} G Eq and d(v,t ) < d(s,t ) — 1. If there are multiple such 
vertices, we distinguish one arbitrarily and call it the local f-connection of s. A path starting from s and 
ending to t using only local t-connections is called a local ( s , t)-path. 


The second requirement is probabilistic and expresses the fact that for all distance scales and “directions” 
there should be significant probability of observing an edge. This property is satisfied by Rank Based 
Augmentation and is essentially what was actually used to prove navigability originally. 


Uniform Richness. Given a y-coherent geometry (V, d) with parameters a, rp > 0 define kg := e log log “ 

to be the distance scale of edges having distance 0(log (n)). A product measure G(n, Q) is then called 
^-uniformly rich for ( V. d) if there is a constant M > 0 such that for every k > kg every edge ( i,j ) with 
d{i,j) € (7 fc_1 ,7 fc ] satisfies Qij > A/lo ^ (n) ^r- 


In other words, since we are interested in routing in poly-logarithmic time and slow traveling can be done 
through the substrate (connected base graph), the probabilistic requirements concern only edges of longer 
distance. As we show next these two requirements are sufficient for navigability to arise in the general 
setting of random graphs of bounded cost. 


4.1 Reducibility via Uniform Richness 

We start by introducing a deterministic property of graphs that implies navigability, that of reducibility. The 
main advantage of reducibility is that it allows us to separate the construction of the random graph from the 
analysis of the algorithm. 

Reducibility. Given a graph G(V,E), we will say that a pair (s,t) € V x V is p-reducible if 3C > 0 
such that among the first C(log V\) v vertices of the local ( s,t)-path there is at least one vertex u such 
that (u,v) G E and d(v,t ) < A d(s,t). If every pair (s,t) G V x V is p-reducible we will say that G is 
p- reducible. 

Proposition 1. If G is p-reducible, greedy routing on G takes at most i+C(logn) 1+p steps. 

Proof of Proposition 1. Given any arbitrary pair of vertices (s,t) with distance at most n, the reducibility 
property of G guarantees us that after at most C log p n steps we will obtain a new pair (s ', t) with distance 
reduced by a constant factor. Since, the new pair is also p-reducible, we can repeat the process until we 
reduce the distance again by a constant. After at most log^ n iterations we will reach the target. Since, the 
pairs were arbitrary, this holds for all pairs and thus the graph is navigable in l+C(logn) 1+p steps. □ 

Lemma 3. Given a y-coherent geometry (V. d) with a substrate Eq and a random edge set E q sampled 
from a 9-uniformly rich product measure G(n, Q), the graph G(V, Eq U E q ) is (6 + 1 )-reducible with high 
probability. 
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Proof. To prove that the graph is ( 9 + l)-reducible we will (i) prove that the event B st that any fixed 
source-destination pair (s, f) is not (9 + l)-reducible has very small probability under G(n, Q), and (ii) use 
union-bound to argue that the probability that any pair is not (9 + l)-reducible is small as well. To simplify 
the proof, we first distinguish between pairs (s,t) where within the first Clog e+1 (n) steps of the f-local 
path there is a vertex with distance smaller than d(s,t) by a constant factor A < 1 and where there is no 
such vertex. Pairs (s,t) that belong in the first case, are (9 + l)-reducible with probability 1. Hence, we 
only need to focus on the latter case, where all vertices on the first C log^ +1 ^ (n) steps are within the same 
distance scale k s t := |~log 7 d(s, f)~| as s from t. We will refer to k s t as k to ease the notation. For each 
such vertex v on the f-local path, property (H2) of coherent geometries tells us that there are at least qrA 
candidate edges that would reduce the distance from t by a constant factor A < 1. The probability Q vz of 
each such good edge (v, z) is lower bounded by — log 9 +q ;i) » since the measure G(n. Q) is (9-uniformly 
rich. Let T(s, t ) be the set of all such good edges. We can write the probability of the event B st as: 


Pq (Bst) 


II ( l ~Qe)< 

e£T(s,t) 



1 

Mlog e+1 (n) 7 fc 


IUM)I Clog 9 + 1 (n,)j)7 fc 

< e M log 0 (71)7* < n W 


where we used that |T(s,f)| > Clog 0+1 (n) ■ q>y k due to (H2) and the definition of reducibility. For any 
£ > 0 and C > (2 + l)H- we get that P (B st ) < n _ ^ 2+ ^. To finish the proof, we perform a Union Bound 
over all possible sets (s, t). Let B be the even that the graph C(V. Eq U Ef) is not (9 + l)-reducible, then: 

P Q (B) = P Q ((J B^) < Y, Pq (Bst) < n 2 n~^ = n~ £ 

St 

for any £ > 0. Thus, the graph G(V, Eq U Ef) is d-navigable with high probability. □ 


4.2 Analyzing the Product Measure G(n, Q *{B)) 

Our next step will be to show that for a range of values of B, the product measure defined through (6) is 
9-uniformly rich for some 9 > 0. In doing so, our previous result shows that such a product measure leads 
to navigable graphs. Recall that Q *(B) is the matrix where for all k € [K\ and ij G Vk it holds that 
Q*j = (1 + exp(A(H)cfc))" 1 and g(X(B)) is the expected budget corresponding to an element generated 
according to the product measure Q *(B). 


Proposition 2. For B > Bj : = max{Ho, g{Xe)}, the product measure G(n, Q *(B)) is 9-uniformly rich. 
The number A g is explicitly defined as Xe({pk}, { c fc}) := mi r\k g <k<K 


1 OgPfc 


l I g log log n 

log Pk 


Proof. This follows easily by the definition of Xq. In particular, consider an edge (i, j) of scale k > he'- 


QU B ) 


[1 + exp (c k X(B))} x > 


p k \og e (n) 


> 


1 

A\og d (n)^ k 


where the last inequality follows from (HI). 


□ 


Proposition 3. For B < B g := g(Ae) the product measure G(n, Q *{B)) has 0{n ■ log 
high probability. The number Ag is explicitly defined 1 as Ag({p k },{c k }) := maxk g <k<K 


9+1 (n)) edges with 
log Pk (I _ 0 log log n ^ 
Ck \ logp/c J 
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Proof. For all Bq < B < B + , by definition of A g we have that for all k > kg: 


Qij( B ) = I 1 + exp(c fc A(.B))] < \p k log y (n) 

Thus, the expected number of edges is upper bounded by: 

log 9 (n) 


-1 


n 


Akg ■ p ke + (K - kg) maxp k - 

k>kg Pk 


n ■ O ^loglog(n) log e (n) + log(n) log 0 (n)^ 


as kg = O(loglogn), p ke = 0(log e (n)) by (HI) and K = O(logn). Applying standard Chernoff 
bounds [3] we get the required conclusion, as by definition for B > Bq each class has at least a poly- 
logarithmic number of edges at the maximizer and thus the expected value (under the product measure) of 
the edges is tightly concentrated around the mean. □ 


4.3 Analyzing Graphs of Bounded Cost 

Proof of Theorem 4. For any B > Bo, consider Q *(B) the matrix corresponding to the optimal profile 
(Lemma 2) and two random elements E ± ~ G(n, (1 ± e)Q *(B)). By Theorem 3, we get that for e = 
\/24/log( n) the probability of the event W, i.e. that E C Ey C E + , is at least 1 — n 5A . To prove 
Theorem 4 we will condition on the above event and then use our analysis of the product measure. To prove 
Navigability we will use the relation E~ C Ey and the fact that Navigability is monotone property. Let 
Nd(E) be the event that that the graph C( V, Eq U E) is not d-navigable, then: 


IP (N d (E c )) = 

p {N d (E c ) n w) + P(iv d (E c ) n W) 

(12) 

< 

¥{N d (E c )\W) + ¥(W) 

(13) 

< 

P Q * (N d (E~)) + n~ 5K 

(14) 

< 

n~ e +n~ 5K 

(15) 


where we used the law of total probability in the first equality, Bayes Theorem and monotonicity of the 
probability measure in the second inequality , Theorem 3 and monotonicity in the third, and Lemma 3 and 
Proposition 2 in the last. This proves part (a) of the theorem. To prove part (b) we follow the same method 
but for the event {|-Er| = w(npoly(log(n)))} and exploit the inequality Ey c E + . Using Proposition 3 
and Theorem 3 we get the required conclusion. □ 


4.4 Analysis of Indexing 


Proof of Theorem 5. We first start with the proof of part 3 of the Theorem. Instead of considering cf oc k 
we can equivalently consider, due to (HI), c* k oc log p k - Thus, for simplicity cf = )- log pk- Set If , = g{o), 
for such B and an edge (u, v ) of scale k, we have 


^ uv l + exp(A (B a )c* k ) i + eX p(ata) 1 + Pk 

Now, by property (HI) we know that for any vertex u and every vertex v within distance scale k from u, 

N u (d(u, v)) € [a, A] 7 fc , thus we get that: 


a \ 1 

2 AJ N u (d(u, v)) 


— n a. k - Quv( B a ) < k < 

LL J 


2AY 


— ) --- 

a) N u (d(u,v)) 


(16) 
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Setting r = 2A/a proves part (b). To further see the correspondence between Random Graphs of Bounded 
Cost when the cost corresponds to indexing and Rank based augmentation, consider the a* k (B a ) the average 
number of edges of scale k per vertex. We have: 


4(B a ) = -r^~ « 1, Vfc G [K\ 
f + Pk 

Thus, we see that the scale invariance property of RBA is recovered. Furthermore, we have that in this case 
B a = Y,k=\ a *k(B a )c* k = 0(log 2 (n)) and the average degree of a random graph of bounded cost for B a is 
©(log (n)). 

To show the first two parts of the theorem we essentially obtain estimates for B± given in Theorem 4 
for the special case where the cost is the cost of indexing as above. We have: 


A g — OL ( 1 + 9 
A* g = a ( 1 - 9 


log log n 
log p K 
log log n 
log p K 


(17) 

(18) 


By property (HI) we know that log p k = ©(log n). Define as before B + = g(Ag) and B = g(\g) . Then 
for every B~ < B < B + or equivalently for A g < A (B) < A g, we have that for some C > 0: 


n 


/ \ -- 22 - 
log n lo s n 



Pk 


1 + exp(A(H)c^) 


= O 


ce 

log n lo § 71 


i k 


where a k (B) = 1+oxp ((x(B) c «) expresses the average number of edges of scale k per vertex. Thus, by (HI) 
we get that: 

1 K 1 

B + = - ^2 a U B+ ) log Pk > ~ log p K a* K {B + ) = H(log(n) 1+c e ) 
a z —' a 

k =1 

Further, B~ < B a = ©(log 2 (n)). Hence, we obtain that B + /B~ = Q(poly(log n)). The proof is 
concluded by invoking Theorem 3. □ 


4.5 Rank Based Augmentation for Coherent Geometries 

Recall that in RBA a single link is added for each vertex u to a random vertex v with probability given by 


Prba(u,v) 


1 1 

Z\N u (d(u,v))\ 


(19) 


where N u (£) := {i £ V : d(u,t ) < 7} is the set of vertices that are within distance £ from u. Here we 
show that the Kleinberg’s original proof can be applied with ease when instead of the semi-metric induced 
by set-system, we have a semi-metric corresponding to a coherent geometry. There are basically two steps. 
We first upper bound the normalizing constant Z and then lower bound the probability that for a given pair 
(s, t) we find an edge in the first C log 2 (n) steps of a path along the substrate that reduces the distance to t 
by a constant factor. 

Proposition 4 (Bounded Growth). For a coherent geometry (V, d), 3C < oo such that Z(l) < C log(ra). 
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Proof. For a given vertex u, we divide vertices depending on their distance scale k € {0,... , log 7 (n)} from 
u. For k > 0, we know from property (HI) that there are at most Ay k such vertices. Further, we also know 
that \Bk-i(u)\ = JJi=o Pk{u) > a . Using these two facts we have: 


log(n) 


Z(l) = ^ P a (u,v) <—+ ^2 Pk ( l 


vev 


k =1 


Bk-iM 


< 


A A "A } 

-V 7 

a a 

k =t 


k F3T- 4( 1 + ^ log 7 K) 


7 fc — 1 


a 


□ 


Finally, to complete the proof, we are going to employ once again reducibility. 

Proof of Theorem 2. Fix any two vertices s, t, the probability of finding a long-range edge at s reducing the 
distance by a constant factor is at least: 


\D(s,t)\ 1 > 1 07 fc = _ 

Z P fc (s) — C log n A'-f AClogn 

Thus, the probability of the event B st that no such edge exists after C' log 2 (n) trials is at most: 


P(5 st ) < 


_^\ c ' lo g 2 w 

\ AC log n ) 


< e ac logn < n ac c ' 


For C' large enough and a union bound over the @(n 2 ) possible pairs of vertices, we get that if E,i is the 
random set of edges added through RBA and Eq is a substrate for the coherent geometry (V. d), then the 
graph G(V, Eq U Ed) is d-navigable with high probability. □ 


5 Set-Systems are Coherent Geometries 

We begin by recalling the definitions of set-systems from [11]. 

Definition 1 (Set System). Let V be a finite set of vertices and let X = {Si,... , S m } be a collection of 
subsets ofV. If a set S contains a vertex t we will say that S is f-bound. 

Fix 0 < A < 1 and 3 > 1. We say that X is a (A, f)-s e t system if all the following hold: 

(Kl) V € X. 

(K2) If\S\ > 1, then for every t € S, there is a t-bound S' C S of size |S ; | > min{A|S|, |S| — 1}. 

(K3) If Sl{v) is the union of sets that contain v and have size at most L > 2, then |S£,(u)| < /3L. 

Given a set system X on a set of vertices V, we define the distance (semi-metric) between two vertices. 

Definition 2. For any two vertices u,v € V, their distance in X, denoted by d>- (it, v), is the size of the 
smallest set in X containing both vertices minus I, i.e. v ) = min,g G j]{|S| — 1 : u, v € S}. 

The goal of this section is to show that the geometry (V, d%) is coherent for any (A,/3)-set system, 
i.e., prove that the semi-metric dx satisfies properties (HI) and (H2) for a suitable 7 > 1. Towards that 
direction, the main hurdle is obtaining for all v € V upper and lower bounds on Fk(v). the number of 
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vertices at distance in [p/ k 1 , 7 fc ] from v. The basic observation that guides the proof is that for all v and 

k > 1 

Pkiv) = \B k (v)\- \B k ^(v)\ (20) 

where B k (v) is the set of all vertices having distance from v at most y k . This representation is very conve¬ 
nient because the properties of set systems are directly related to \B k (v)\. In particular, if we get good upper 
and lower bound for \B k (v)\ then we can obtain upper and lower bounds for P k (v) and prove (HI), which 
comprises the main challenge. 

Obtaining the upper bound is trivial, since it is directly given by (K3). However, the lower bound on 
B k (v) requires more thought as it needs to be tight enough so that when substituting both bounds in (4) 
(in order to obtain a lower bound on P k (v)) the difference is strictly positive. It turns out that the last 
property depends on the particular values of the parameters A, 3. We show that it is always possible to select 
7 = lift- A) > 1 such that the last property holds. The main observation that will provide a lower bound 
on \B k (v)\ is that the existence of a set S with size in ( 7 fc_1 ,l k ) implies that \B k (v)\ > |S'! for all v £ S. 
This is because all vertices in S have distance at most 151 — 1 from v. Thus, what remains is to show the 
existence of such set S for all v £ V and k. To that end, we need the following axillary le mm a that was 
implicitly stated and used in Kleinberg’s original work [11]. 

Proposition 5 (Shrinkage). For every S £ X with S > 1/(A — A 2 ) and for every t € S, there exists a 
t-bound set S' £ X with A 2 |S| < |S'| < A|S|. 

Proof of Proposition 5. Assume, for the sake of contradiction, that there exists a set S and a vertex f £ S 
such that the proposition does not hold. If we start with S and invoke (K2) iteratively until we reach t, we get 
a sequence S = Si D S 2 • ■ ■ A S k = t of t-bound subsets of S. Since |S| > A|S|, there is a largest index i 
such that |Sj| > A|S|, and S,| > 2 since A|S| > 1. Therefore, we can apply (K2) to S r yielding a t-bound 
set of size at least z = min{A|Sj|, | S t \ — 1}. For the hypothesis to hold it must be that 2 < A 2 |S|, for if 
z > A|S| we contradict the maximally of i. But having z < A 2 |S| is impossible since the fact S,j > A|S| 
implies A|S*| > A 2 |S|, while combined with the fact \S\ > 1/(A — A 2 ) it implies |Sj| — 1 > A 2 |S|. □ 

This lemma will be used to show that for all vertices v one can start from the set V, that belongs in X 
by (Kl), and inductively apply Lemma 5 to deduce the existence of sets S containing v at all scales. More 
specifically, given a (A, /3)-set system X, let M be the smallest integer such that \~ 2M > \V\. We partition 
the range of possible set-sizes in X as X = (Ii,..., Im) by letting I k = (A -2 ( fc-1 ) ; A -2fc ], for k £ [M], The 
partition X implicitly partitions all pairs of vertices into groups, such that all pairs in a group have roughly 
the same distance in X, i.e., up to a factor of A 2 . We show that for every vertex and for every interval of the 
partition, there is a set with size in that interval that contains the vertex. 

Proposition 6. For every t € V, for every k £ [M], there exists a t-bound set S £ X with |S| £ Ik- 

Proof of Proposition 6. Assume, for the sake of contradiction, that there exists a vertex t for which the 
proposition does not hold. Let k 0 £ [M] be the largest integer such that there is no Abound set S' € X with 
| S'| € I ko . If we start with V and invoke (K2) iteratively until we reach t, we get a sequence V = .S) D 
S 2 ■ ■ ■ D S k = t of Abound sets. Let i ko be the largest index i such that |S, | £ I ko + 1 . The maximally of A-q 
implies |Sj fcQ +i| £ I ko -i- But invoking Proposition 5 for S, fcfj implies |Si fc +i| € I ko , a contradiction. □ 

Treating X as a distance scale, our next goal is to obtain for each vertex t, upper and lower bounds on 
the number of vertices that lie at each distance-scale from t. To achieve this we need to consider a coarser 
partition of the set sizes than X. To do that it will be beneficial to use a partition built out of blocks of X, thus 
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allowing us to utilize Proposition 6 , proven for X. In particular, the existence of a /-bound set of each size 
will be the basis for obtaining lower bounds on the number of vertices at each new distance scale from t. 

We let r = r(/3, A) > 2 denote the smallest integer such that A ~ 2 ( r ~ i) > g an( j consider the partition 
that results by grouping together every r consecutive intervals of X. That is, for -y(/3, A) = A“ 2 r 'W- A ), we 
define the partition A = A{ 7 ) consisting of the intervals A k = ( 7 fc_ 1 , 7 fe ], k € 77], where K is the 
smallest integer such that '/ K > \V\ — 1. Having defined A, we now let P k (v) denote the number of vertices 
whose distance from v lies in the set A k and we let P k = 4 Pk( v ) denote the total number of pairs of 

vertices whose distance lies in A k . 

Lemma 4 (Bounded Growth). Let a = (A 2 — /3/ 7 ) > 0 and A = (/3 — A 2 / 7 ). For all k £ [K] and v € V, 

a-7 fc < P k (v) <A- 7 fc . 

Proof of Lemma 4. First observe that A is a coarsening of X since 7 = A' 2r and r > 2 is an integer. Next, 
let B k (v) = Yli<k P k {v) be the number vertices in V whose distance from v lies in Ai U • • • U A k , i.e., is no 
more than y k . Condition (K3) asserts that B k {y) < /TyX On the other hand, by Proposition 6 , we know that 
for any v € V there is a n-bound set S € I r k C A k . Since, all vertices in S have distance at most |S| < y k 
from v, we get that B k (y) > |5| > \~ 2 ^ rk ~ 1 '> = ^ k \ 2 . Therefore, for all k € [K\, 

A 2 7 fc < B k (v) < • (21) 

Using the representation (20) and invoking (21), we get 

A 2 7 fc - Pj k ~ l < P k {y) < /?7 fc - A 2 7 fc “ 1 

which is equivalent to the claimed statement. The fact a > 0 is implied by our choice of 7. □ 

Thus we have shown property (HI). Proceeding further, we need to show that the semi-metric 7s satis¬ 
fies also the isotropy property (Section 4), i.e. that the size of the set D\(s,t) = {v £ V : d(s,v ) < 
7 fcst and d(v,t) X A d(s,t)} is proportional to 7 fcst , where k st is the scale of d(s,t)- To do that we are 
going to show something stronger. Given any two vertices s A t € V, consider a S s t € S of minimal size 
such that both s,t £ S. Then for all k < k st define the following set G k (s,t ) = {v € S st : d(s,v) € 
A k and d(v, t ) < A|5|} of vertices in S s t whose distance from s lies in the interval A k (scale k) and whose 
distance from t is no more than A 1 57 1 . 

Lemma 5 (Isotropy). For every s A t € V with |,S', s / | > 1 /(A — A 2 ), we have that 

\Gk st (s, t) U Gfc st _i(s, t)\ > ■ 

Proof of Lemma 5. Proposition 5 implies that there is a /-bound set S' £ £ with A 2 1 S s t \ < | S"| < A|5', s /|. 
Thus, a A 2 fraction of the vertices in S s t have distance from t at least a factor A less that Having 

established an abundance of “good” vertices in S s t, we are left to show that a constant fraction of them 
are in the top two distance scales k s t,k s t — 1 from s (recall that \S s t\ € A kst ). We start by noting that 
^ = £*<*:! P [X |, as the sum must count the vertices in S . Since £md JX | P 

A 2 |S’ s t|, we get Z > A 2 7 fcst_1 . On the other hand, the good vertices in the bottom k st — 2 distance scales 
from s are a subset of all vertices containing s at those distance scales, a quantity bounded by (K3) as 
E t < k -2 |Gi(M)l < /?7 fcst - 2 - Therefore, |G fert (s,t) U G^s, /)| > AV ^" 1 - Py kst ~ 2 . □ 
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Proof of Theorem 1. In order to prove that the set system defines a coherent geometry, we need to show that 
properties (HI) and (H 2) hold for some 7 > 1. Our two lemmas achieve exactly that. The first property 
follows from Lemma 4 and the second property follows from Lemma 5 since Gk at (s,t ) U Gk at -i(s,t) C 
D\(s,t). □ 
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